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Abstract 


A  formal  development  of  planning  systems  is  constructed  in  this  paper. 
The  problem  solving  system  is  developed  in  the  framework  of  a  modal  logic  and 
the  concept  of  weakest  precondition  is  introduced  to  guide  the  solution 
procedure.  The  procedure  is  a  generalization  of  several  well  known  problem 
solving  algorithms.  Moreover,  there  may  be  some  gains  in  practicality  of  the 
algorithm  resulting  from  this  extension,  which  is  illustrated  by  an  example. 


Introduction 

In  this  paper,  we  consider  formalizing  problem  solvers  for  predicate 
calculus  expressions.  Many  of  the  problem  solving  systems  in  the  literature, 
while  often  ingenious,  are  essentially  variations  on  the  Resolution  Principle  [5 
In  an  ad  hoc  way,  most  of  these  have  introduced  operations  over  predicate 
calculus  models,  and  devised  some  method  of  machine  search  over  the  operators. 

It  is  precisely  these  features  of  the  literature  that  are  dealt  with  in  this 
paper. 

Our  first  action  is  to  demonstrate  that  STRIPS  [2]  programs  are 
examples  of  programs  in  the  formal  language  Dynamic  Logic  [  3],  This  is 
a  useful  feature  in  itself  as  it  allows  us  to  examine  STRIPS  with  the 
analytical  tools  a  formal  language  provides.  We  then  can  observe  that  our 
operators  are  not  limited  to  STRIPS  type  operators.  A  much  larger  class  of 
operators  becomes  available  via  this  formalization.  These  operators  may 
simulate  actions,  as  in  STRIPS,  or  may  only  be  calculations,  such  as 
multipl ication . 

An  important  advantage  of  a  formal  theory  of  operators  is  the 
analytical  tools  available.  In  this  paper  we  focus  on  Dijkstra's  [  1] 
weakest  preconditions  as  an  excellent  example  of  operator  analysis.  These 
weakest  preconditions  permit  us  to  evaluate  in  advance  whether  application  of 
an  operator  will  achieve  a  goal.  Because  this  is  a  formal  criteria,  it 
is  naturally  embeddable  into  a  machine  for  automated  decisions.  Another 
useful  feature  of  r>e  formal  analysis  is  the  ability  to  prove  results  about 
the  system.  In  particular,  it  was  possible  that  the  STRIPS  approach, 
involving  only  concatenations  of  operators  and  not  allowing  for  unions  or 
iterations,  was  a  restriction  of  a  system  involving  these  latter  operations. 

In  this  paper,  we  prove  that  concatenations  of  operators  is  sufficent  for 


r 

solving  STRIPS  type  problems.  The  evaluation  function  £  2 ]  of  STRIPS  is 

an  ad  hoc  method  of  determining  whether  or  not  a  given  operator  is  appropriate. 

We  formalize  this  by  defining  a  measure  of  distance,  called  a  pseudometric, 
over  the  space  of  predicate  calculus  models.  In  most  cases  this  allows  for 
increased  efficiency  in  the  decision  process.  Since  this  simulates  the 
approach  of  a  human  decision  maker,  it  is  a  useful  tool  in  mechanized  problem 
solvers.  Humans  have  some  notion  of  what  models  are  close  to  one  another,  and 
formalizing  this  may  allow  the  machine  a  similar  faculty.  Moreover,  pseudometrics 
have  been  implicitly  assumed  in  many  of  the  extant  problem  solvers. 

There  is  some  advantage  to  their  explicit  use,  as  shown  in  McAfee  and  Whinston  [4]. 

II.  .1  The  Guarded  Commands  Language 

In  order  to  extend  and  formalize  STRIPS,  we  may  embed  STRIPS  inside  a 
larger  formal  language.  Although  STRIPS  operates  within  the  context  of 
predicate  calculus,  STRIPS  is  essentially  beyond  predicate  calculus, 
because  its  operators  are  not  tautological  proof  procedures. 

STRIPS  operators  are  modal  operators,  i.e.  their  application  reflects  a 
change  of  states.  For  this  reason  it  is  sensible  to  embed  STRIPS  in  a 
language  capable  of  expressing  facts  abouts  state  changes.  As  we  shall  see, 
a  subset  of  Dynamic  Logic  T3}  called  the  Guarded  Commands  language  £l}  makes 
an  excellent  choice  for  such  a  language. 

The  guarded  commands  language  is  a  strict  extension  of  predicate 
calculus,  and  the  set  of  terms  is  identical  in  both. 

The  set  of  programs,  PGC,  for  Programs  in  the  Guarded  Commands  language, 
are  defined  inductively  as  follows.  We  have  restricted  the  origional 
formulation  to  deterministic  programs.  Let  e  be  any  term,  P  and  R 
be  predicate  calculus  formulas,  and  x  a  variable.  Then 

1)  ABORT  €  PGC 

2)  x  *•  e  €  PGC,  (assignment) 

3)  for  a,  B,  £  PGC,  s;  B  g  PGC  (concatenation) 

4)  for  iy ,  fl  €  PGC  and  predicate  calculus  expression  P, 

P-;  a  .U"JP?;S  €  PGC  (program  union) 


5)  (P? ;  «)*!”!  P?  €  PGC  (unbounded  iteration) 

This  is  actually  a  subset  of  Dijkstra's  original  program  set,  limited  to 
deterministic  programs.  Determinism  is  a  useful  assumption  that  will  cost 
little  ,  if  any,  generality. 

The  programs  are  intended  to  be  interpreted  with  variables  ranging 
over  the  real  numbers  and  the  standard  meaning  of  any  functions  and 
constants  necessary,  e.g.  +,  .  ,  0,  1.  We  assume  there  is  a  well  defined 
interpretation  of  predicate  symbols.  The  predicate  calculus  quantifiers 
and  connectives  are  all  given  their  standard  interpretations.  The  semantics 
of  the  program  induction  are  given  as  follows: 

1)  Abort 

2)  Assign  term  e  to  variable  x 

3)  do  a  then  B 

4)  If  P  do  (y  else  8 

5)  While  P  do  x 

Observe  that  do  loops  may  be  constructed  using  iteration: 

x  ♦-  o;  (x  <  n?;<y;  x  ♦-  x  +1)  *  ;  x  >  n? 

is  a  program  which  executes  at  for  x  *  0, ,  .  .  n 

For  a  more  specific  example,  consider  the  PGC  program 

i  1;  j**0;  (i  £  n? ;  j  j  +m;  i  «-  i  +1)  ;  i  >  n)? 

This  program  adds  m  to  j  n  times,  that  is,  it  calculates  ran.  Actually, 
in  the  STRIPS  environment,  we  need  only  begin  with  a  finite  set  X  of 
programs  (operators)  from  which  we  may  compose  new  programs.  This  forms 
the  set  of  Elementary  Programs  E  ,  which  is  defined  inductively  below: 

1)  X  £  E 

2)  o',  B,  €  E  then  or,  B€  E 

3)  at,  8,  €  E  and  P  a  first  order  statement,  then  (P?;  ty  U  IP?;  B)  €  E 

4)  a  €  E  and  P  a  first  order  test  then  (P?;or)  *;  TP  €  E 
The  semantics  are  the  same  as  in  PGC. 


Given  a  set  of  programs  E  ,  a  sub- language  of  DL,  called  PDDL-GC,  is  defined 
similar  to  the  predicate  calculus  induction: 

1)  any  predicate  calculus  expression  is  in  DL 

2)  for  a  €  E  >  and  DL  expressions  P,  R,  then  "jP,  PvR  (3x)  P  and  <^y>  P  €  DL 
The  semantics  of  these  expressions  is  the  same  as  in  predicate  calculus 
except  for  <  a  >  which  is  undefined  in  predicate  calculus.  <  a  >  P  is  inter¬ 
preted  to  mean  "a  terminates  with  P  true",  that  is,  after  operating  <y,  P 
will  be  true. 

If  (v  corresponds  to  a  STRIPS  operator,  then  P  ->  <  o'  >  R  means  that  if 
P  is  true,  then  operation  of  a  will  make  R  true.  It  is  very  useful  to 
have  such  expressions  formally  expressible,  as  they  may  be  very  useful  in 
guiding  a  STRIPS  search.  Through  such  expressions,  we  may  explicitly 
represent  information  on  how  to  procede  to  make  R  true.  This  allows  us 
to  replace  STRIPS'  "evaluation  function"  [ 2  ]  with  an  organized,  formally 
stated  and  automatable  body  of  knowledge  about  what  works  to  prove  goals. 

However  Dijkstra's  formalism  possesses  considerably  more  power. 

II.  .2  The  Weakest  Precondition 

The  Guarded  Commands  Language  was  designed  to  make  program  verification 
possible  and  simple.  Dijkstra's  original  purpose  was  to  establish  conditions 
under  which  a  program  cy  will  terminate  with  a  predicate  calculus  expression 
R  true,  i.e.,  what  implies  <  a  >  Rl  He  demonstrated  that  in  the  domain  of 
GC,  and  hence  all  subprogram  sets,  there  is  a  weakest  precondition  (wp)  expressed 
in  infinitary  predicate  calculus  so  that  wp(cy,  R)  =  <  a  >  R.  Thus  we  may  embed  a 
fragment  of  the  program  verification  problem  inside  predicate  calculus,  where 
we  have  the  Resolution  Principle  [5]  and  other  first  order  theorem  provers 
at  our  disposal.  Dijkstra  provides  these  wp  expressions  via  the  induction 
below,  although  our  notation  remains  that  of  Harel  [3]. 


wp  (abort,  R)  ~  false 

wp  (x  -  e,  R)  RC  Where  R®  is  R  with  x's  replaced  by  e's  and  all  bound 

variables  of  R  appearing  in  e  are  renamed. 

wp  (a;  8,  R)  r  wp  (iy,  wp  (p. ,  R)) 

wp  (p? :  <v  1 1  IP?;  B.  R)  (P  -*  wf  (o'.  K>>  a  (IP  -*  wp  (p,  ><>> 
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II. 


wp  ( (P? ;  cv)*;  “1 P? ,  R)  =  V  H  where  H  =  IP  a  R 

n-o  n  o 

and  H  s  P  A  wp(ar,  H  ) 
n+1  n 

Observing  that  corresponds  to  the  condition  implying  iteration  of  the  express¬ 
ion^?;  cd  n  times  makes  R  true,  one  may  quickly  establish  that  all  of  these  equivalences 
hold.  These  equivalences  provide  a  method  of  constructing  the  weakest 
precondition  implying  for  a  €  PGC,  <  a  >  R,  for  first  order  R. 

In  general  we  cannot  expect  the  weakest  precondition  of  programs 

involving  iteration  to  have  predicate  calculus  equivalents.  V  H  is 

n 

n=0 

an  expression  in  an  infinitary  version  of  predicate  calculus,  and  outside 
of  ordinary  predicate  calculus.  The  formulation  of  the  weakest  preconditions 
demonstrates  that  for  any  program  a  without  iteration,  <  a  >  P  has  a 
predicate  calculus  equivalent.  We  will  argue  that  for  a  problem  solving 
environment  where  the  initial  state  is  given  and  fixed,  iteration  is 
unnecessary.  In  this  way  we  will  avoid  the  need  to  work  in  an  infinitary 
environment.  Clearly,  for  many  types  of  program  analysis,  it  is  necessary 
to  include  iteration. 

2.3  STRIPS  Operators 

The  operators  of  STRIPS  change  world  states  in  an  organized  fashion.  Any  STRIPS 

operator  a  is  given  by  an  ordered  triple  of  sets  of  predicate  calculus  expressions 

<  {  P  .  ,  P  ],  {A  , .  .  .,  A  },  fD  , .  .  . ,  D  }  >.  The  application  of  a  has 
X  lc  X  x  n 

a  precondition,  which  is  that  P  =  P^  A  ?2  A.  •  •  A  holds.  If  P  is  true, 

then  operation  of  a  results  in  a  new  world  state  where  the  expression 

A  =  A.  a.  .  .  A  A  is  asserted  and  D  , .  .  .  ,  D  are  no  longer  asserted. 

I  m  in 

STRIPS  presumes  that  nothing  else  is  changed  in  the  world  state  transition, 
except  of  course  that  new  implications  may  be  derived. 

To  model  STRIPS,  we  must  embed  the  changes  in  the  truth  values  of 
expressions  into  changes  of  variables.  This  observation  reveals  a  direct  method  of 
such  an  embedding,  that  is,  provide  for  variables  whose  values  are  truth  values  of 


expressions. 


Thus,  for  each  predicate  calculus  expression  E,  we  define  a  variable 
whose  semantic  interpretation  is 
yg  =  1  indicates  E  is  asserted  true 
Ug  =  -1  indicates  E  is  asserted  false 
Ug  £  {l>  -  1}  indicates  neither  E  nor  "IE  is  asserted. 

We  may  inductively  build  on  this  definition 


i) 

ii) 


^  IE  * 

^E  V  F 


P'E  =  1  or  UF  = 
UE  =  -1  and  !*F 


0  otherwise 


1 


iii) 


1  if  3x  Ug  (x)  =  1 


axE 


J  -1  if~l(3x)  (x)  =  1 
o  otherwise 


Ug  =  x  is  notation  for  EQUAL  x)  ,  the  equality  predicate.  We  also  establish 

a  rule  of  precedence:  If  £  (1,  -1}  and  it  is  proved  that  =  1 ,  then 
Ug  1  .  This  rule  of  precedence  should  be  used  to  adjust  for 

new  information  after  any  theorem  proving,  that  is,  after  proving  an  expression  E, 
make  sure  to  assign  Ug  «-  1-  The  STRIPS  program  a  is  very  easy  to  incorporate 
into  DL  with  this  formalism.  Let 


a  s  P?  |iA  1;  up  ,  *•  0 ;  .  .  .  uDn  «-  0  u"1*’?  ABORT. 

Then  <y  tests  whether  the  precondition  P  holds,  if  so,  it  makes 

A  =  A,  a.  .  a  A  true  and  makes  D, ,  .  .  . ,  D  unasserted.  Otherwise  it 
1  n  1  n 

fails. 

A  STRIPS  system  is  merely  a  collection  of  operators  of  STRIPS  type.  In 
the  notation  of  section  2.1,  we  let  X  =  {STRIPS  operators  written  in  program 
form}.  The  set  of  admissable  programs  is  then  E. 

Thus  we  may  embed  STRIPS  operators  into  the  GC  formalism.  A  STRIPS 


problem  consists  of  an  initial  state,  expressed  in  predicate  calculus, 
and  a  goal.  The  goal  is  a  predicate  calculus  expression  R.  The  intent  of 
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STRIPS  is  to  find  a  sequence  of  operators  n,.  .  . ,a  so  that  operation  of  this 

1  m 

sequence  results  in  R  being  true.  This  is  converted  into  GC  by  first 
observing  that  a  predicate  calculus  initial  model  is  an  assignment  of  terms 
to  the  p,  variables.  Hie  goal  is  the  same  expression  R.  The  problem 
remains  to  find  any  sequence  of  programs  (operators)  where  wp  (0^;.  .  .  ;g^,  R) 
is  truein  the  initial  state. 

In  section  3,  we  shall  develop  the  mathematics  of  GC  to  suit  our 
program  synthesis,  and  not  program  verification,  needs.  This  will  result  in 
the  construction  of  a  GC  problem  solver.  In  Section  4,  we  shall  return  to 
STRIPS  for  a  comparison. 

Ill  .  1  Some  Mathematical  Results. 

In  order  to  show  that  the  planning  system  in  the  following  section 
operates  in  the  manner  intended,  we  need  some  analysis  of  PDDL-GC.  Although 
many  of  the  sections  results  apply  to  other  varieties  of  DL,  we  will  assume 
PDDL-GC  throughout. 

Lemma  1  •  Let  a,  0  be  GC  programs  and  P  =  wp  (o',  R) .  Then  if  there  exists 
a  first  order  test  Q?  so  that  wp  ( (Q ? ;  o'  IJ  “|Q;  g),R)is  true,  we  have 
wp  ((  p5fY  U  HP?;  0).R)  is  true. 

Proof:  wp  ((  Q? ;  <*  ij  "IQ?;  g),R)  =  Q  -*  wp  (a,  R)  A  ~iQ  ->  wp  (0,  R)  was  previously 

noted.  This  implies  Q  -»  P,  and  by  contrapositive tP  -♦  ~\Q,  Therefore 

P  b  wp  (a,  R)  -♦  wp  (r y,  R)  and  nP  -*  iQ  -»  wp  (0,  R) .  Consequently 

wp ( (  P?;  <y  tJ  IP?;  0)R)  =  P  -*  wp  (<*,  R)  A  AP  -*  wp  (B,  R)  is  true,  as  desired. 

Lemma  2:  (Dijkstra  [  1  ])wp  (<y,  P  v  Q)  s  wp  (os  P  )  V  wp(a,Q) 

wp  (a,  P  A  Q)  ;wp  (o,P  )  A  wp(a,Q) 

Lemma  3:  LetP  *  wp  (<y,  R) .  Then  wp  ( P? ;  a  U  "1p? ;  0,  R)  e  wp  (a,  R)  V  wp  (g,  R) 

Proof:  wp  (P? ;  a  IJHP?;  0,  R)  =  P  -♦  wp  ( a ,  R)  a  HP  -*  wp  (g,  R) 

e  wp  ( a ,  R)  -*  wp  (&,  R)  Aiwp  (cv *  R)  -»  wp  (g,  R) 
s-\wp  (o,  R)  -»  wp  (fi,  R) 
e  wp  (a,  R)  V  wp  (0,  R) 
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3  demonstrates  that,  under  the  hypothesis  P  =  wp  (® ,  R) , 

U  functions  exactly  like  boolean  disjunction. 

Lenw  j  4:  Let  S  *  wp(cy,  P)  and  suppose  or  terminates.  1 
Then  wp(tf;(P?;e  R)  =  wp(S?;a;0  Ul  S?or;v,R) 

Proof:  wp(ff;(P?;0  Ul??;v),  R)  *  wp(<*,wp(P?  ;B 
wp (a,  P-*wp(B,R)  AlP  -*wp(v,  R))  s 
wp  (a,  P  -*  wp(B,R))  A  wp  (o' ,  TP  wp(v,  R) )  = 
wp  (a,  IP  V  wp(B,  R))  A  wp(cv,  P  V  wp(y,  R))  ~ 
wp(cr,  IP)  Vwp(o;0,  R)  A  wp(a;P)  vwpfev,  R)  = 

7S  v  wp(a*,B,  R)  A  Sv  wp(ff5V)  R)  s 
S  -♦  wp(a;B,  R)  a~1S  -»  wpCcr.Yi  R)  - 
wp(S?;  a;g  U  lS?;a;v,  R) 

Definition  5:  6?;*  U  OSaT)?;VU  (nS  AlT)?;p)  s  S?;ccUlS?;  (T?;v  UlT?;g) 

This  rule  may  be  used  recursively  to  identify  the  meaning  of  a  union  of 

n  programs  P, ? ;  a,  U  P  ?;  a-  U-  «  .  U  P  ?;  a  •  Also  note  that  then 
L  i  Z  z  n  n 

restriction  on  such  unions  is  exactly  i)  P  v  P„  V  .  .  .V  P  is  a 

1  Z  n 

tautology  and  ii)  The  P^'sare  pairwise  inconsistent.  Any  finite  union 
satisfying  i)  and  ii)  is  a  PDDL-GC  program. 

Lemma  6: 

i]  wp((P?;a  U1P?;8);(R?;v  u  1R?;6),  S)  = 
wp  ( (  Pa  wp  (a,  R) )  •  ;o,;v  u  (Pa  wpfo,  iR))?;a';6  U 
(1PA  wp(ffl  R))?;8;v  U  ("I ?a  wP(p,  n  R))?;d;6  ,  s) 
ii]  wP(P?;(R?;ff  U1R?;B)  UlP?;(T?;vU  1T?;6  ,  S)  - 
wp(PAR?  ;a  u  Pa  1R?;B  (IIPaTTjy  U!Pa^?;A  ,  s). 

The  proof  of  lemma  6  is  straightforward  • 

Lemma  7i  Any  PDDL-GC  program  formed  by  composition  of  base  programs  using  U 
and  ;  (without  iteration)  may  be  equivalently  expressed  in  the  disjunctive 

lwp(cr,lP)  5lwp(o,  P)  if  and  only  if  a  terminates.  See  Dijkstra  [1]. 
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normal  form  P^jc^  U  P2?;  a2  U’  '  ,lj  Pn?  ;oV  where  01  i  is  3  concatenation 
®  1 » »  *  '  *  * »  Pm^  of  base  programs,  and  is  as  in  Definition  5. 

Proof:  Using  lemma  4,  we  may  drive  base  programs  inside  unions.  Using 
6  and  5,  we  may  rewrite  unions  concatenated  with  unions  and  unions  of  unions 
into  new  unions.  This  is  repeated  until  the  disjunctive  normal  form  is 
established.  The  new  program  is  equivalent,  at  least  in  terms  of  input/- 
output  behavior,  because  the  wp's  of  each  program  are  identical. 

This  result  is  obvious  in  the  sense  that,  by  expanding  the  computation 
tree  of  any  program  without  iteration,  there  are  only  finitely  many 
branches  and  thus  there  must  a  condition  P^  that  determines  whether  the 
computation  proceeds  down  the  i^  branch.  However,  in  the  process  of 
proving  lemma  7,  we  have  provided  an  algorithm  for  calculating  P. .  This 
is  where  the  usefulness  of  the  result  lies. 

Suppose  and  initial  state,  given  by  a  set  of  predicate  calculus  axioms, 
is  given  and  fixed.  Then  it  is  clear  that,  for  any  program  <*  if  <  a  >  Q 
holds,  then  any  iterations  operate  finitely  many  times  and  there  will  be 
a  sequence  of  base  programs  8^,.  .  .  ,0^  such  that  <  0^;.  .  . ;  0^  >  Q  is  true. 
This  yields: 

Observation  8:  In  the  STRIPS  environment,  where  initial  states  are  fixed, 
concatenation  of  base  programs  is  sufficient  to  create  a  solution,  if 
one  exists. 

Of  course,  iteration  and  union  of  base  programs  makes  an  interesting  topic 
for  A. I.  research,  but  the  STRIPS  search  method  is  sufficient.  Because  it 
is  in  general  undecidable  how  many  iterations  are  necessary  to  achieve  a 
goal,  this  creates  a  difficult  research  issue,  and  is  beyond  the  scope 
of  this  paper. 


r 
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‘Rieorems  analogous  to  Lemmas  1  and  3  arise  in  the  general  union  case. 

Lemma  9:  let  ^  -  wp  (o^,  R),  IPj  A  .  .  .a  A  wp  (c^,  R),  i-2..  .  n-1 

and  Pn  pj  A  •  •  •  A  ?n-l.  Then  if  there  are  first  order  expressions  Q, . Q 

X  n 

Such  thatwp(Q1?5  c^U  Q2?;  »2  U  •  •  •  U  Qn?;  a  n,R)  is  true,  then 

wp  ((P-?;  a  U*  •  .U  P  ?;»  ),  R)  is  true, 
j-i  n  n 

The  proof  is  analogous  to  the  proof  of  Lemma  1 
Lemma  10:  If  P^,  .  .  are  chosen  as  in  Lemma  9,  then 

WP  U*  •  •  pn?;  o'n,  R)  s'  wp  (o^,  R)  v  wp  (» 2>  R)  v.  .  .v  wp  R) 

Proof:  It  is  easily  demonstrated  that 

wp  (Pj?;  cy1  IJ.  .  .U  Pn?;  o^,  R)  s  Pj  -♦  wp  (<yj,  R)  a  P2  -*  wp  (a2>  R)  a.  •  •  APn-*  wp  (»  ,R), 

from  definition  5.  For  all  i  <  n,  we  have  ?t  -  IPj  A  **2  A.  .  .A  TP  A  wp  (<*  R) 

Therefore  for  i  <  n,  P^  -♦  wp  ( a R)  is  clearly  true,  and  thus 
WP  (Pj’;  a-[  U.  .  .U  PR • ;  Q,n>  R)  s  P^  -*  wp  (an,  R)  £  ip^  v  wp  (a  ,  R) 

*  7(lP1  AlP2  V  .  .  .a  lPn_1)  V  wp  (cyn,  R)  f  Pj  V  P2  V.  .  .v  Pn  l  V  wp  (cyn,  R). 

Observe,  for  sentences  A  and  B,  since  A  s  (AaB)  V  A 

A  V  (iAaB)  s  A  V  (AaBI  V  (iAaB)  s  A  V  (  (  A  V  -|A)  A  B)  s  AvB 

Letting  A  =  wp(<y,R)  and  B  ■  wp(«2>  R) ,  we  have 

v  P2  H  WP  (<Yj>  R)  V  (7  wp  (cy^,  R)  a  wp  (or2 ,  R)  )  ■  wp  (c^,  R)  v  wp  (c*2»  R) . 

Suppose  for  induction  that  Pj  V  P2  V.  .  .vPk  =  wp  (ay  R)  V.  .  .V  wp  (cy  R) 

This  holds  for  k  =  2.  let  A  =  Pj  v.  .  .V  Pk  and  B  =  wp  (»k+1»  R) 

Then  ?l  V..  .vPk+1  s  (P1  V.  .  .VPk)V  Pk+f<p1  V.  .  .V  Pfc)  V  A .  .  .A7PfcA  wp  («k+1>  R)> 

s  (P1  V.  ,  .  V  Pfc)  V((7(P1  V.  .  .V  Pfc)  A  wp  (tyk+1,  R))b 

(Pj  V.  .  .  V  Pfc)  v  wp  (afc+1,  R)  e  wp  (cy  R)  V.  .  .  V  wp  (cy  R)  v  wp  (a  ,  R) 

For  any  k  s  n  -  1.  Therefore  letting  k  =  n  -  1,  we  have 

Pj  V.  .  .v  Pn_j  «  wp  (cy  R)  V.  .  .  V  wp  (o  ,  R). 

Therefore  wp  (P  ?;  or,  U-  .  .U  P  ?;  a)  s  P,  v.  .  .V  P  .V  wp  ( a  ,  R) 

11  n  n  i  n-i  n 

s  wp  (or,  ,  R)  V.  •  .V  wp  (( y  ,  R)  as  desired, 
i  n 


Suppose  we  have  an  Initial  state  I,  a  first  order  goal  R,  and  some 
first  order  facts  in  our  knowledge  base  which  are  true  in  all  legal  states. 
Denote  the  last  set  F.  Facts  about  the  state  I  are  all  facts  concerning  the 
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assignment  of  terms  to  variables. 

Now  we  might  try  to  prove  R  from  F,  inserting  knowledge  of  assignments 
of  I.  There  is  an  algorithm,  the  Resolution  Principle , f  5},  specifically 
designed  for  doing  this.  The  Resolution  Principle  checks  a  first  order 
expression  R  against  a  body  of  facts  F  by  assuming  1R  A  F  and  seeing  if  this  is  incon¬ 
sistent.  If  7RaF  i»  inconsistent,  then  F  proves  R,  and  R  is  true  with  respect  to  F. 
Resolution  operates  with  two  parts,  one  a  theorem  prover  with  respect  to 
predicates,  the  other  a  unification  principle  that  assigns  terms  to  variables 
attempting  to  facilitate  the  theorem  proving.  In  the  DL  context,  unification 
changes  states.  Because  we  wish  to  control  our  states,  and  indeed  unification 
might  move  us  into  an  illegal  state,  we  will  operate  resolution  without  the 
unification  principle.  If  the  unification  principle  is  desired,  it  may  be 
added  as  a  base  program,  so  there  is  no  loss  of  generality  from  this  alteration. 

In  the  process  of  resolution,  we  will,  however,  allow  assignments  of  the  state 
I  to  variables  to  be  inserted  for  those  variables.  This  limited  unification, 
together  with  the  theorem  prover,  might  be  termed  resolution  with  respect  to 
the  state  I.  When  we  refer  to  resolution  in  a  DL  context,  we  will  always  mean 
resolution  with  I  -  unification. 

Now  in  attempting  to  prove  R  from  F  inside  I,  we  may  operate  resolution, 
attempting  to  prove  a  contradiction  from  TR  a  F  a  I,  where  I  denotes  both  the 
state,  and  the  first  order  characterization  of  that  state  (e.g.  if  x  *•  e  in  I, 
then  EQUAL  (x,e)  characterizes  this  assignment,  where  EQUAL  (x,y)  holds  if  and 
only  if  x  “  y) . 
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If  an  inconsistency  is  found,  the  problem  is  solved  and  we  need  not  apply 
programs,  as  R  is  already  true.  So  suppose  resolution  operates  for  a  long  time 
and  no  contradiction  is  uncovered.  Let  A  be  an  assertion  proved  by  resolution. 

In  the  next  section  we  shall  give  criteria  for  which  expression  A  to  choose. 

In  the  proof  of  A  using  resolution,  a  certain  number  of  facts  from  I  were  used. 

Let  this  set  be  ,  .  .  .,  B^.  These  are  called  the  state  depenents  facts  in 
the  proof  of  A.  Finally,  suppose  R  itself  is  a  subgoal,  R  =  wp  (o',  Q)  and 
O'  is  a  proposed  program  for  making  Q  true  (which  may  itself  be  a  subgoal).  So 
if  R  is  true,  then  operation  of  <y  makes  our  desired  result  Q  true. 

Lemma  It  Suppose  there  is  a  program  B  such  that  A  -»  wp  (p,  Q). 

Then  wp  ( (R? ;  a  I.)  "1R? ;  0),  Q)  is  true. 

Proof:  wp  ( (R? ;  n  U  1R? ;  8  ).Q)  =  3  wp  Gy,  Q)  V  wp  (0,  Q)  =  R  v  wp  (B,  Q) 

If  R  is  true,  then  R  V  wp  (p,  Q)  is  true.  If  1 R  is  true,  then  resolution 
proves  A  is  true,  which  by  hypothesis  implies  wp  (B,  Q). 

Lemma  12:  Suppose  there  is  a  program  B  so  that 
wp  (p,  R  V  (B,^  a.  .  .  A\AlA))  is  true.  Then 
wp  (fi;  o',  Q)  is  true. 

Proof:  wp  (0,  R  V  B^  a.  .  .A  B^  A  1  A)  =  wp  (B,  R)  V  wp  (p,  B1  a.  .A  Bfc  A"1A). 
is  assumed  true.  If  wp  (0,  R)  holds,  then  wp  (B;  O',  Q)  s  wp  (8,  wp  Gy,  Q)) 

=  wp  (B,  R)  is  true  that  is,  operation  of  0;  a  proves  Q. 

On  the  other  hand  assume  wp  (B,  B  a.  •  .A  B^  a  "I  A)  is  true.  He  know  that 
-,R  a  F  a  A.  .  .A  B  -*  A,  because  B^  .  .  . ,  Bfc  are  precisely  the  state 

dependent  facts  used  to  prove  A  in  the  resolution  operation.  Since  by 
hypothesis,  after  operation  of  8  we  have  B^  a.  .  .A  B^  AT  A,  and  because  F  is 
true  in  all  states,  we  cannot  have  TR.  Otherwise  we  would  have  (iR  A  F  A  Bj  A.  .  .A 
Bk)  a  nA  implying  Aa  TA.  As  a  result,  if  wp  (B,  Bj  A.  .  •  A  \  A  "7  A)  is  true, 
wp  (0,  R)  is  true,  which  as  we  saw,  implies  wp  (0;  o',  Q)- 


The  interpretation  of  Lemma  11  is  as  follows:  suppose  we  operate  resolution 
trying  to  prove  wp  (a,  Q)  is  true.  We  are  unable  to  prove  an  inconsistency, 
but  we  do  prove  an  assertion  A.  The  assertion  A  is  the  difference  between  R 
and  success.  Consequently  we  attempt  to  handle  this  difference  by  adding  a 
separate  case.  A  sufficient  condition,  as  we  saw,  for  handling  this  difference 
is  to  find  a  program  B  so  that  A  -♦  wp  (£,  Q) .  The  process  is  not  contingent 
upon  finding  such  a  (3,  however.  We  might  produce  a  candidate  V  for  B,  and  then 
extract  the  difference,  handle  this  case,  extract  a  new  difference,  etc. 

It  is  in  general  undecidable  if  this  method  converges.  Consequently, 
one  can  only  note  that,  if  there  is  a  solution,  and  a  backtracking  method 

A 

is  implemented  so  that  any  possible  concateration  of  base  programs  is 
eventually  examined,  then  the  procedure  will  terminate  with  a  solution.  If 
there  is  no  solution,  then  in  many  instances  this  will  be  undecidable. 
Undecidability  is  a  fact  that  we  must  live  with,  and  do  what  we  can.  Of 
course,  it  is  advisable  to  avoid  trying  to  decide  undecidable  issues. 

The  reader  should  also  note  that  there  is  no  problem  if  resolution  is 
stopped  too  early.  For  suppose  R  =  wp  (a,  Q)  is  true,  and  that  resolution  is 
stopped  before  an  inconsistency  arises.  Then  wp  ((  R? ;  a  U  lR? ;  B)»Q)  will  be 
true.  This  follows  as  R  is  true,  and  hence  R? ;  a  I  0  will  execute  (>»  and 

we  assumed  R  =  wp  (a,  Q)  is  true.  Thus  there  is  no  loss  from  terminating 
resolution  too  early. 

The  solution  method  inherent  in  Lemma  12  is  perhaps  more  appropriate,  as 
it  relies  on  the  concatenation  and  not  disjunction  of  programs.  As  we 
observed  in  Observation  8,  concatenation  is  a  more  promising  approach,  as  we 
can  always  solve  the  problem  via  concatenation  of  base  programs.  In  Lemma  12, 

A  is  again  viewed  as  the  difference  between  R  and  success.  This  time,  instead 


•of  attempting  to  handle  A  as  a  separate  case  (as  A  -»  wp  (0,  Q),  we  havelwp  (0,  Q)  -♦ 
*1A,  and  1A  implies  R  true)  instead,  we  attempt  to  find  a  program  0  to  negate  A. 

This  would  allow  us  to  complete  the  proof  of  R,  and  thus  have  wp  (0;  <*,  Q)  true. 

Again  we  may  note  that  even  if  resolution  is  terminated  before  it  would  have 
proved  a  contradiction,  there  is  no  loss  of  generality.  The  predicate  wp  (0,R)  is  true 
so  0  cannot  negate  R.  Consequently,  wp  (0;  a,  Q)  remains  true. 

III.  .2  A  solution  method 

The  basis  of  the  solution  method  will  be  replications  of  Lemma  12. 

We  initialize  the  problem  by  providing  a  goal  R  ,  an  initial  state  I,  a 

o 

set  of  base  programs,  and  a  set  of  first  order  truths,  F. 

Begin  the  solution  method  by  attempting  to  prove  Rq,  using  resolution  on  F 
and  state  facts  I.  If  Rq  is  proved,  then  it  is  currently  true  and  there  is 
no  sense  applying  programs.  If  Rq  is  true,  halt  with  this  fact.  Otherwise, 
extract  the  difference  between  R^  and  success  as  in  Lemma  12.  The  means  of 
producing  the  difference  A^  and  the  state  dependent  statements  B°,  .  .  ,,B° 

o 

will  be  discussed  later.  For  now,  assume  we  have  them.  Let 

Q  s  Rq  v  (  7Aq  A  B1  a  .  .  .A  B°  ),  we  select  a  program  »  ,  which  is  a  candidate 

o  1 

for  making  Q  true.  The  means  of  producing  this  program,  together  with  the 
mechanism  for  the  choice  of  A,  will  be  discussed  later.  For  now,  presume  that 
we  obtain  it.  Let  R^  =  wp  (q^,  Q°).  Observe  that  lemma  12  implies  that  if 
R^  is  true,  we  are  done. 

The  selection  of  R^  will  serve  as  initialization  for  a  general  induction 

to  show  that  the  solution  method  preserves  the  situation  that  R*  true  means  a  solution 

has  been  found.  Suppose  we  have  found  Rn  =  wp  (o  ,  Q°  *) ,  and  if  Rn  is 

n 

true,  the  problem  is  solved.  Operate  resolution  on  Rn  to  see  if  Rn  is  true. 

If  we  fail  to  prove  Rn,  extract  a  minimal  difference  A^  and  required  state 
dependent  statements  b",  .  .  . ,  b”  in  the  proof  of  A^.  Let 

Q  »  R  V  (  An  a  B"  a  .  .  -A  ),  and  choose  a  program  q.  +  1,  to  be  a  candidate 
n  +1  n 

for  making  Q  true.  Let  R  s  wp  Qn) •  Observe  that  lemma  12  Implies 

if  Rn+*  is  true,  then  operation  of  a  makes  Rn  true,  and  we  assumed  R°  true 

Iwl 
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meant  we  had  the  problem  solved.  As  a  result,  we  have  preserved  the  property 
that  R*  true  means  the  problem  is  solved.  Thus,  this  is  a  valid  recursion  on 
the  R*'s,  moreover,  if  we  reach  a  state  where  R*  is  true,  then  operation  of 
“i5  ®i-i»*  •  •!  »2  ’  ffl  makes  Rq  true,  our  goal.  Hence,  the 
desired  program  is  5  •  •  •  5  by  iteration  of  Lemma  12. 

This  is  the  basis  of  the  decision  procedure.  What  is  left  is  the  formulation  of 
the  choices  of  and  • 

There  is  a  natural  procedure  to  gu^de  the  choice  of  q^.  Because  a  program 

only  assigns  terms  to  variables,  although  perhaps  in  quite  complex  ways,  the 

choice  of  is  entirely  one  of  finding  a  program  that  gives  values  to  the 

variables  which  force  ~IR  a  F  to  be  inconsistent  and  hence  make  R  true.  Thus,  we 

wish  to  find  a  program  that  unifies  the  literals  in  a  set  of  expressions,  that 

is,  our  program  must  simulate  the  most  general  unifier  of  the  resolution  principle. 

Because  the  most  general  unifier  is  readily  calculable,  it  serves  as  a  basis  for 

the  choice  of  a  •  Unification  defines  the  variables  which  must  be  altered  and  even 
n 

the  manner  in  which  they  should  be  altered  in  order  to  prove  7Rn  A  F  inconsistent. 
Thus,  a  natural  procedure  arises  for  the  choice  of 

This  simulation  of  unification  in  the  choice  of  programs  is  not  arbitrary. 
Resolution  is  a  special  case  of  the  problem  solving  environment  described  herein. 

In  Resolution,  there  is  a  set  of  facts  and  a  goal  R,  to  be  shown  as  a  theorem 
of  these  facts.  Resolution  operates  by  literal  elimination,  or  attempting  to 
show  R  directly,  and  appropriate  unification,  or  assignment  of  terms  to  variables. 
Resolution  is  the  special  case  where  all  programs,  or  assignments  of  terms  to 
variables,  are  permitted.  In  this  Resolution  environment,  the  most  general 
unifier  serves  as  a  good  method  of  proving  R.  When  we  restrict  the  set  of 
valid  programs  and  move  to  the  problem  solving  environment  of  this  paper, 
naturally  the  best  choice  of  programs  will  emerge  from  simulating  the  unconstrained, 
resolution  situation.  Thus,  the  solution  method  of  this  paper  yields  a  model  of 
problem  solving  in  a  more  general  environment  where  Resolution  is  a  special  case. 
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The  choice  of  A  emerges  from  the  Resolution  model  in  a  similar  fashion.  A 
n  n 

is  simply  the  expression  which,  when  unified,  proves  a  contradiction,  or  comes  as 

close  as  can  be  to  doing  so.  That  is,  A  is  the  set  of  expressions  the  Resolution 

n 

principle  would  choose  to  unify  and  solve  the  problem  with. 

The  simulation  of  Resolution  can  be  considered  inefficient  in  the  sense  that  it 

provides  no  information  toward  avoiding  blind  alleys.  Ihe  wp  predicates  yield  some 
information,  but  unless  there  is  a  means  of  evaluating  where  the  weakest  pre¬ 
condition,  wp (<y,  R)  is  closer  to  the  initial  state  than  R  itself,  we  will  have  no 
way  of  knowing  if  »e  areheaded  in  the  correct  direction.  As  a  result,  we  will  define 
such  a  measure  of  "closeness  to  the  goal"  in  the  section  on  pseudometrics,  which 
will  complete  the  problem  solving  system  of  this  paper. 

As  described,  this  system  contains  no  backtracking.  The  skeleton  of  a 

backtracking  system  will  be  discussed,  with  the  details  left  to  the  reader. 

Suppose  a  level  n  is  reached  so  that  no  program  eliminates  a  literal  from 

A  .  Then  we  should  backtrack  to  level  tv-1  before  <y  ,  is  applied  and 
n  n-1 

use  the  next  best  program  a  instead  of  a  and  then  continue  with  the 

n- 1  n-1 

program  synthesis.  If  at  some  point  this  precipitates  backtracking  to 

level  n*o,  we  can  either  choose  a  new  difference  statement  A',  or  we  can 

o’ 

begin  looking  at  pairs  (then  triples,  etc)  of  programs  o^B  to  negate  A^. 

By  observation  8,  such  a  method,  as  it  would  eventually  check  all  conjunctions  of 
base  programs,  will  be  complete.  This  means  if  a  program  can  be  found  to 
satisfy  the  goal,  it  will  be  found.  As  we  mentioned,  if  there  is  no  solution 
then  this  fact  will  often  be  undecidable. 

It  is  probable  that  a  combination  of  lemmasll  and  12  would  provide 

♦ 

the  optimal  search  plan.  The  general  difficulty  in  finding  a  B  so  that  S 

A  -♦  wp  (8,  R) ,  coupled  with  the  question  of  which  route  (11  or  12)  to 

i 

take  at  any  juncture  prevents  this  from  being  developed  here.  It  is  a  j 

good  topic  for  future  investigation  in  this  area.  ’ 

J 
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III. 


3  Comparison  to  The  Resolution  Principle 

The  final  observation  of  this  section  is  that  in  an  environment  with 
only  the  unification  program,  the  system  devolves  to  be  a 
modification  of  resolution.  First  note  that  in  the  absence  of 
other  programs,  we  cannot  change  states.  Therefore  there  are  no 
state  dependent  facts .  In  order  to  prove  an  assertion  R,  the  initial  step 
was  to  operate  resolution  and  then  exit  with  either  a  contradiction 
and  success,  or  a  minimal  assertion  Aq  and  a  set  of  state  dependent  facts. 

As  noted,  this  set  is  empty,  so  we  have  only  the  assertion  A  . 

Since  there  is  only  the  unification  program  ,  we  set  up  a 
new  goal  wp  (au»  R  V  7A^)  *  R  V  TA  and  attempt  to  prove  that. 

This  is  equivalent  to  choosing  a  conjunct  of  the  set  of  theorems  proved 

by  resolution  and  specifically  trying  to  negate  it,  as  opposed 

to  generally  searching  for  a  contradiction.  If  we  fail  to  prove  7Aq 

(and  thus  R)  with  resolution,  we  take  a  second  difference  A^  and  try  to 

prove  R  V  TA^  v  1A^.  This  will  continue  until  either  we  prove  a  contradiction 

or  we  get  a  new  conjunct  A^.  Because  we  will  be  taking  progressively  more 

minimal  statements  A^,  we  are  imitating  resolution,  only  with  a  focus. 

We  focus  in  on  statements  that  seem  the  easiest  to  contradict,  as  opposed 
to  generally  resolving  all  the  theorems  of  resolution.  With  backtracking, 

(the  choosing  of  a  different  Aq) ,  this  method  Is  approximately  equivalent 


to  the  Resolution  Principle. 
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IV.  .1  Comparison  to  STRIPS 

The  GC  problem  solver  described  shares  many  features  with  STRIPS.  As 
with  the  GC  problem  solver,  STRIPS  intially  operates  resolution  to  try  to 
prove  its  goal.  If  this  falls,  it  chooses  a  theorem  T  resulting  from 
the  resolution  and  tries  to  prove  it.  The  theorem  T  corresponds  to  A  a,  B  a.  .  .B 

1  K 

In  our  model.  It  is  a  result  of  the  proof  which,  if  made  true,  forces 
the  goal  to  be  true.  STRIPS  then  searches  for  a  satisfactory  operator,  as 
does  the  GC  problem  solver.  Given  this  operator,  it  again  tries  to 
prove  the  new  goal.  If  this  fails,  it  extracts  the  difference  of  the  new 
goal  and  the  current  state,  and  continues.  This  is  precisely  what  the 
GC  problem  solver  does,  albeit  in  a  more  formal  manner.  Consequently, 
in  a  STRIPS  problem  environment,  the  GC  solver  simulates  STRIPS. 

However,  there  are  several  differences.  The  first  major  difference  is 
that  the  GC  problem  solver  has  the  ability  to  use  information  about  the 
result  of  applying  operators  without  actually  applying  them,  in  the  form 
of  the  weakest  precondition.  Ihis  allows  it  some  vision  as  to  what 
branch  is  promising  in  the  search  tree,  vision  that  is  denied  STRIPS. 

STRIPS  with  MACROPS  attempts  to  cope  with  this  lack  of  vision  by  remembering 

what  was  previously  a  successful  path,  however,  there  is  no  problem  in 
doing  the  similar  thing  with  the  GC  planner  and  STRIPS  still  lacks  any 
vision  concerning  the  application  of  concatenations  of  MACROPS.  Thus,  even  in  a 

STRIPS  environment,  the  GC  solver  can  check  whether  wp(o,  R''  is  harder  to  satisfy 

than  R,  and  hence  whether  to  apply  a- 

A  second  difference  regards  unification.  It  may  be  that  we  declare  some 
assignments  of  variables  illegal  and  wish  to  prevent  their  occurance. 

STRIPS  places  no  restrictions  on  unification  in  its  operation,  while 
such  limitations  are  possible  in  DL.  If  we  wish  to  rule  out  x*0  in  our 
system,  we  may  merely  prevent  any  operator  from  ever  assigning  0  to  x. 
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This  is  one  way  in  which  GC  has  a  wider  scope  than  STRIPS. 

Another  important  way  is  that  in  GC,  we  can  call  relevant  data 
(a  data  program  is  simply  x  «-  e,  e  being  the  data  on  variable  x)  with 
a  simple  program.  Moreover  we  have  programs  available  which  are  much 
more  complicated  than  STRIPS  operators. 

Consequently,  we  see  that  the  GC  problem  solver,  while  simulating 
STRIPS  within  a  STRIPS  environment,  can  calculate  the  results 
of  its  actions  before  it  takes  them,  allowing  it  a  modicum  of  "intuition" 
about  which  approach  to  take.  Moreover,  it  is  at  home  in  more  complicated 
environments  where  calculations  and  not  just  theorem  proving  must  be  done. 
Finally,  by  Lemma  2.12,  we  actually  were  able  to  prove  that  STRIPS  did 
what  it  claimed,  that  is,  if  it  found  its  subgoal  true,  then  the  sequence 
of  operators  did  make  the  goal  true. 

IV  4.2  Pseudometrics 

The  search  for  an  appropriate  program  to  apply  toward  a  goal  may  be 
directed  using  the  method  of  pseudometrics  [4]. 

A  pseudometric  is  a  distance  measure  over  a  space  X.  It  is  defined  similarly 
to  the  metric  of  real  analysis. 

Definition:  A  pseudometric  on  a  set  X  is  a  function  p:  X  x  X  -»  R+ 
satisfying 

i)  p  (x,x)  *  0  for  all  x€  X 

ii)  p (x ,x ' )  -  p(x’,x)  *  0  for  all  x,  x'  €  X 

iii)  p(x,  x')  +  p  (x',x")  2  p(x,x")  for  all  x,  x’ ,  x"  €  X 

The  difference  between  a  metric  and  a  pseudometric  is  that  we  may  have 
p(x,x')  ■  0  for  x  x'  if  p  is  a  pseudometric  but  not  if  p  is  a  metric. 
Because  X  will  be  a  space  of  models  in  our  application  and  we  do  not  wish  to 
distinguish  between  every  model  with  our  measure,  we  will  use  pseudometrics. 
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We  do  not  wish  to  distinguish  between  all  models  because  often  there  will 
be  characteristics  which  are  irrelevant.  If  we  do  wish  to  distinguish  all 
models,  then  metrics  are  relevant,  and  they  are  special  cases  of 
pseud ome tries . 

We  will  characterize  the  space  of  models  by  the  first  order  sentences 

that  are  true  in  the  model.  Let  S  , .  .  .,  S  be  a  finite  set  of  sentences, 

1  n 

and  q  ,  .  .  .,  cn  he  nonnegative  real  numbers  called  criticality  values. 

is  interpreted  to  be  the  "weight"  or  importance  of  the  it'1  sentence. 

Let  be  two  models.  A  clash  set  f is  defined  by 

’  jk 

r,.  =  { ±/ S  is  consistent  with  one  of  the  models  and  not  the  other}, 
jk  i 

A  pseudometric  p  is  then  defined  by 

p(Mj,  mS  -  I  Ci 

lefjk 

1  k 

for  any  models  MJ,  M  .  To  prove  p  is  a  pseudometric,  we  must  establish  that 

the  rules  iX  ii)  and  iii)  hold. 

1  k 

i)  if  M  =  M  ,  then  is  either  consistent  with  both  or  neither.  Therefore 

T..  -  0  and  p(MJ,  Mk)  -  Z  °i  =  Z  C*  =  0. 


jk 


ierjk  i€0 
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IV 


ii)  Observe  that  r,,  =  T,  ,  as  the  order  of  the  models  is  irrelevant. 

Jk  kj 


Iherefore 

ct  *  ^  ct  ■  P<Mk,  MJ) 


moreover,  since  c 
o 


kj  y( 

■t  *  0,  L 


*  o. 


iii)  Finally  given  M  ,  M  and  M  claim  r02  U  T12 


Proof  of  claim:  Suppose  i€  Tq2  •  is  either  consistent  with 


o  2  2  o 

M  and  not  with  M  ,  or  with  M  and  not  M  .  Suppose  the  former 


holds.  S^  is  either  consistent  with  M  or  not.  If  is  consistent 


1  2 
with  M  ,  we  have  that  it  is  inconsistent  with  M  and  thus  i^r^.  Otherwise 


its  inconsistent  with  and  not  with  M°,  and  consequently  if  Fm •  Hie 


01 

argument  where  S.  is  inconsistent  with  M°  and  consistent  with  is 


symmetric.  Therefore  i€  rQ2  implies  i C  Tq^  U  F12»  and 


r02  C  r01  U  ri2’  38  desired‘ 


Therefore  p  (M°,  M1)  +  p  (M1,  M2)  =  V  CjL  +  CjL  *  ]T  CjL 


y  c  +  s  c 

Li  ^  i 


1€r01  i€r!2  i€r01  U  r!2 


J  c1-  p(M°,  M2) ,  and 


i€r02  1€(rol  U  r!2)/r02  1€T02 


consequently  the  triangle  inequality  holds. 

Thus  we  have  shown  that  the  construction  of  p  satisfies  the  properties  of 


a  pseudometric.  Thus  p  is  a  distance  measure  over  the  space  of  models.  The 


distance  of  M°  and  as  measured  by  p  represents  the  clash  of  those  models  with 


respect  to  the  sentences  S^,.  .  . ,  and  the  weights  c^  associated  with  them. 
The  optimal  use  of  p  requires  the  appropriate  choice  of  and  c^.  We  will  dis¬ 
cuss  this  in  the  next  section. 

>3  Directing  a  problem  solver 

Suppose  we  have  a  goal  model  in  mind  and  a  set  of  sentences  S 
which  are  the  criteria  by  which  we  judge  a  model.  Further  suppose  the  weights 


.0  „1 


c  .  c  are  given.  If  M  ,  M  are  two  achievable  models,  and  if 
1  ’  n 


~Z  %  '  «• 


p(M°,MG  1  >  pCM^.Mg  ),  then  by  the  criteria  of  the  pseudometric,  is  closer  to 

M  than  M°  is.  This  is  the  sense  in  which  we  mean  a  pseudometric  can  guide  a 

problem  solver.  STRIPS  implicitly  assumes  a  pseudometric  where  c  =  1  for  all 

i  S  n.  The  sentences  S^  are  the  negations  of  the  assertions  in  the  goal  model. 

This  is  easily  seen.  When  STRIPS  checks  if  a  model  M  is  close  to  a  goal 

model  M  ,  it  checks  whether  the  assertions  of  the  goal  model  are  proved  in 

M.  This  is  equivalent  to  the  negations  of  the  assertions  of  the  goal 

model  being  inconsistent  with  M,  since  the  negation  of  an 

assertion  of  a  goal  model  is  inconsistent  with  the  goal  model.  This  is 

equivalent  to  checking  if  I  £  1%  as  i  £  T  if  S  is  inconsistent  with  M  (and  is 

inconsistent  with  M  ,  S  being  the  negation  of  an  assertion  of  M  ).  Since 

Cj  i  (j 

STRIPS  does  not  differentiate  the  weight  of  the  different  sentences,  we  may 
assume  c^  “  1.  In  passing,  we  note  that  ABSTRIPS  [  6 3  assigns  different  weights 
to  the  sentences  and  gains  some  direction  as  a  result  [4^. 

The  notion  of  inconsistency  applied  is  derived  from  standard  proof 
theory,  using  modus  ponens  and  universal  generalization.  In  particular,  the 
Resolution  Principle  would  yield  the  same  notion  . 

It  is  the  choice  of  and  c,,  that  concerns  us  here.  First  we  identify 
the  Recall  that  we  have  a  subgoal  (goal)  R  which  we  would  like  to  use 

programs  to  make  true  in  the  initial  state.  That  is,  we  would  like  R  to  be 

true,  or  to  find  a  program  0  so  that  wp  (a,  R)  is  true,  in  the  goal  model. 

If  the  model  M  is  a  true  representation  of  the  state  of  the  world,  then  we  would 

like  p  (wp(0,R),  M)  =  0  to  mean  wp  (0,R)  is  true  in  the  initial  state  M.  Thus 

we  want  no  assertion  of  wp  (a,R)  to  be  Inconsistent  with  M.  Consequently  let 

Sl» •  .  •,  be  the  assertions  of  M.  Then  p(wp(g,R),M)  =  0  if  and  when 

wp  (cr,R)  is  not  inconsistent  with  M.  Therefore,  we  wish  to 

let  Sr.  .  . , be  the  assertions  of  the  initial  state,  the  facts  that  are 

known  to  be  true. 

The  choice  of  c^'s  are  a  more  difficult  problem.  However,  there  are  some 


heuristics  to  consider  in  their  choice. 
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First,  if  there  is  a  program  B.  so  that  for  almost  any  R,  S.  is  consistent 

*  X/  1 

with  wp(B,»  R) ,  then  c  should  be  low.  This  is  because  we  can  apply  g 
*  1  l 

with  low  expected  complications  so  that  becomes  only  a  minor  problem.  On 
the  other  side,  if  most  of  the  programs  8  are  such  that  is  inconsistent 
with  wp(0,R)  for  most  R,  then  should  be  large.  This  follows  because 


is  determined  by  what  complications  it  presents  in  the  truth  of  wp(g,  R) 

The  best  choice  of  the  criticality  values  depends  on  the  problem 

environment.  Obviously,  the  relative  importance  of  aspects  of  a  problem 
is  a  subjective  issue,  and  developing  a  more  formal  theory  is  a  topic  for 
future  research. 

IV.  .4  Application  of  a  pseudometric 

Once  a  pseudometric  has  been  constructed,  we  may  use  it  to  direct  the 
choice  of  which  program  to  apply  next.  In  particular,  we  may  define  a  greedy 
strategy,  one  that  chooses  the  maximum  decrease  in  the  pseudometric  value 
with  each  program  application.  If  backtracking  is  available,  this  is  perhaps 
as  good  a  method  as  any.  To  use  the  pseudometric  to  guide  the  program  search, 
we  must  use  it  as  the  criteria  to  judge  which  program  to  choose.  This 
requires  evaluating  the  metric  at  each  time  a  candidate  program  is  proposed  in 
the  manner  described.  Once  such  a  metric  is  set  up,  it  is  a  tremendous  asset, 
as  it  defines  the  notion  of  distance  to  a  solution.  It  should  be  emphasized 
that  much  more  general  pseudometrics  may  be  instituted  [see  example  in  McAfee 
&  Whinston  1980]  than  the  one  described  in  this  paper.  However,  a  formal  notion 
of  distance  to  a  goal  is  a  pseudometric,  and  if  a  machine  is  ever  to  act  as  if 
it  knew  what  'close'  means,  it  must  have  a  measure  of  distance. 

For  computers  to  solve  general  problems,  it  is  likely  that  they  will  have  to 
simulate  human  decision  making.  Despite  vast  computational  power  and  extreme 
speed  per  computation,  computer  solutions  to  general  problems  are  still 
unacceptably  slow.  It  is  thus  of  interest  that  the  procedure  described  in 
this  paper  is  analogous  to  a  human  method  of  problem  solution,  at  least  in 
the  abstract.  The  procedures  first  checks  if  there  is  anything  to  be  done,  that 
is,  if  the  goal  Is  true  currently.  Its  next  step  is  to  consider  the  possible 
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alternatives.  Analogous  to  a  person,  it  checks  if  the  alternative  (program) 
does  something  potentially  useful,  by  means  of  its  pseudometric  intuition, 
and  whether  the  preconditions  of  the  alternative  are  reasonable,  using  the 
weakest  preconditions  of  Dijkstra.  Backtracking  gives  us  the  generality 
of  these  methods,  and  the  pseudometric  gives  us  the  means  of  telling  a  computer 
what  is  reasonable  and  what  is  not.  As  a  result,  the  methods  of  this  paper 
provide  a  new  approach  to  automated  problem  solving. 

V.  Example 

The  example  we  provide  is  similar  to  that  of  artificial  intelligence 
literature.  We  have  a  robot  in  a  2  room  suite.  The  robot's  position 
POSITION  (x,y)  is  described  by  the  variables  (x,y).  There  is  a  window  which 
is  open  when  V  =  1  and  shut  if  V  =»  -1.  The  variable  V  is  associated  with 
the  predicate  "WINDOW  (V)."  There  is  a  door  between  the  rooms,  which  can  be 
open  l  -  1  or  shut  l  -  -1,  and  i  is  associated  with  the  predicate  OPEN  (i) .  There 
are  3  operators  given  below. 


0  :  OPENWINDOW  Precondition: 

Addition: 

Deletion: 


POSITION  (0,2) 
WINDOW  (1) 
WINDOW  (-1) 


THRUD00R 

Preconditions : 

OPEN  (1) 

A  POSITION  (2,1) 

Addition: 

POSITION 

(4,1) 

Deletion: 

POSITION 

(2,1) 

03:  INROOMMOVETO 

Preconditions: 

CSJ 

CM 

K 

Addition: 

Deletion 

POSITION  ^ ,y^ )a[ (x^Ax^Mx^ax^)  ] 
POSITION  (x2,y2) 

POSITION  (x^) 
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The  corresponding  GC  programs  are 

B1:  (x-0  A  y-2)?;  V  «-  1  U  ”l(x=0  A  y-  2)?;  ABORT 

a  x=2  A  y=l)?;  x  «-  4;  y  1  |J  A  x=2  A  y*l)?;  ABORT 

C(x<3  A  x2<3)  V  (x>3  A  x2>3)]?  x  «-  y  ♦*  LH[  (xs3  A  x2«3)  v  (x>3  a  x2>3)  '  ■  ABORT 

A 

We  denote  the  state  changing  parts  of  by  0^  i.e., 

0t  =  V  f.  1,  02  =  X  «-  4;  y  «-  1,  03  s  x  «-  w;  y  4-  z. 

Our  goal  for  this  example  Is  the  following  model  Rq  : 

POSITION  (6,0)  A  WINDCW  (1) 
or,  in  DL  notation,  x  =  6,  y  =  0,  V  =  1. 

Our  initial  state,  Mj,  in  this  example  is: 

POSITION  (0,4)  A  WINDCW  (-1)  A  OPEN  (1) 

That  is,  x  *  0,  y  =  4,  V  =  -1  and  i  =  1. 

According  to  the  method  of  this  paper,  to  calculate  the  pseudometric  we 

take  the  sentences  of  the  initial  model,  negate  them  to  form  the  critical 

sentences  S, ,  .  .  . ,  S  ,  and  assign  them  criticality  values  c* .  .  . ,  c  .  We  shall 
1  n  in 

do  this  with  the  sentences  V  =  -1  and  l  -  1,  however,  we  shall  introduce  an 
interesting  twist  on  the  sentence  x  =  0  a  y  =4  .  Essentially  this  will  involve 
using  the  broken  euclidean  distance  metric  as  the  measure  of  distance. 

Let  Sj  be  ~I(V  =  -1)  and  S2  be  ")(f  =  1).  Let  the  criticality  values  associated  with 
and  S2  both  be  unity.  In  addition,  define  c^  (x^,  y^,  x2>  y2) 


c3  (Xj,  yr  x2,  y2)  -  4)2  +  (yL  - 


-  X0)2  +  (y,  -  y0)2  If  (x,^3  A  x0S3)  V  (x,>3  A  x0>3) 


1) 


r 

if 


(Xj>3  a  x2<3) 


+2  +  \l^2  -  2) 2  +  (y2  -  l)2 


It  is  readily  verifies  the c  gives  the  broken  Euclidean  distance  metric  for 

the  distance  between  (x^,  y^)  and  (x2>  y2).  (see  diagraa  2) 

1  2 

Now  we  define,  for  two  models  M  and  M 
pCM1,  M2)  *  +  c3(x1>y1,  x2>  y2) 

1Cri2 
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Observe,  as  V  is  disagreed  on 

pCMj,  Rq)  s  ^  ci+c3=c1+  c3^°»  4»  6*  0) 

ier _  ______ 

-  1  +\/(6-4)2  +  (0-1)  2  +  2  +l/(0  -2)2  +  <4-l>2 

-  3  +  V5~+  ^13. 


Any  STRIPS  type  program  0  has  a  special  form,  which  is  given  by  P?;p  U  1P?;ABORT. 
For  any  sentence  S,  wp(p,S)  also  has  a  special  equivalent  form,  as  we  now  show. 

a 

wp(B,S)  s  wp(P?  ;0  U  -1P7AB0RT,  S)  = 

(P-*wp(p,  S))  A  (IP  -»  wp (ABORT,  S))  = 

(TP  V  wp(B,S))  a  (P  V  false)  =  (TP  V  wp(B,S))  a  P  s 

A 

P  A  wp(B,S) . 

We  shall  use  this  simple  reduction  throughout  the  example. 


Let  B^  be  our  first  candidate  program.  It  is  a  natural  choice  as  the  preconditions 
of  and  B2  are  not  satisfied.  The  algorithm  yields: 
r  =  wp(B3,  Rq)  s  [6c  <3  A  x2  s  3)  V  (x  >  3  A  x2  >  3)]  A  wp(03,  RQ) 

».  C(6  <  3  A  x2  S  3)  V  (6  >  3  a  x2  >  3)]  a  wp(x  »-  x2;  y  «-  y 2>  RQ) 

s  [x2  >  3]  A  POSITION  (x2,  y2)  A  WINDOW  (1),  as  6  *  3. 

p(Mr  Rt>  »^Ci  +V(0-2)2  +  (4- 1) 2  +  2  +  \/( 4-x2)2  +  (1-y,,)2. 

minimizing  p(Mj.,  R^)  with  respect  to  x2>  y2  yields  x2  =  4  and  y2  =  1 

and  R^  *  Position  (4,  1)  a  WINDOW  (1) 

and  hence  p(Mj,  R^  *  3  +  \/T3,  a  distinct  gain. 

The  preconditions  of  0^  are  not  satisfied,  and  the  use  of  B3  will  not  provide 
any  reduction  of  p,  so  we  let  B2  be  our  next  candidate  program.  Then  we  have: 

A 

R2  S  wp(B2,  R S  (i  -  1  A  X  *  4  a  y  -  1)  A  wp(B2,  Rt> 

(X  .  2  A  y  -  1>  A  wp  (x  4-  4;  y  «-  1,  POSIHON  (4,  l)  A  WINDCW  (1)) 

s  POSITION  (2,  1)  A  WINDOW  (1).  We  have 

p(M1>  R2)  -  e  +  c3  -  1  +  V(0-2)2  +  (4- 1  > 2  -  1  +  /l3, 

again  an  improvement. 
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The  next  program  we  check  is  8^* 

R3  *  wp  (B3,R2)  s  x  *  x2  a  y  =  y2  a  x  <  3  a  V  =  1 
p(R3,  Mj)  »  1  +Y(0-x2)2  +  (4-y2)2  =  1  +\fC22  +  (4-y2)2 
we  could  minimize  this,  but  it  would  not  deal  with  opening  the 
window.  Consequently  the  machine  would  make  an  error  here  (going  to 
(0,4)).  At  this  point  p  ®  1  and  the  machine  would  attempt  to  open  the 
window  by  applying  the  operator  0^.  This  reveals  the  position  problem  as 
we  see 

R^  =  wp(83,R3)  s(x*0Ay  =  2)A  wp^.R^ 

h  (x  =■  0  a  y  =  2)  a  (x*x2Ay=y2)  a  V  =  -l 
=x=0Ay=2AV«-l  Thus  the  wp's  direct  x2  '■  0  a  y2  “  2 
p(R^,  M^)  =>  Vfo  -  0)2  +  (2  -  4) 2  =  2,  an  improvement. 

Finally,  we  operate  0^  once  more 

R5  s  (b3>  R^)  s  x  =  x3  a  y  =  y3  A  wp(03,  R^ ) 

=  x  =  x^  a  y  =  y3  A  [x  =  0  a  y  =  2  A  x  <  3]  A  (V=  -1) 

?  x”x3Ay=y3Axs3AV=-l 
Minimizing  p(R3>  Mj)  =  V(0-x3)2  +  (4  -  y3)2 
subject  to  x3  <  3 

yields  x3  ®  0, y3  *  4  and  p(Rj  M^)  =0. 

p(Rj»  Mj)  *  0  means  the  problem  is  solved  by  the  program  sequence 

(b3;  Bj ;  b3;  b2;  e3) • 

There  are  two  significant  characteristics  displayed  by  this  example. 
First,  the  notion  of  a  pseudometric  admits  such  measures  as  Euclidean  dis¬ 
tance.  One  is  not  restricted  to  such  pseudometrics  as  discussed  in 

M 

section  4,  and  we  may  use  ordinary  measures  to  measure  the  difference  of 


models . 

Perhaps  more  important,  the  pseudometric  may  be  used  to  guide 
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the  choice  of  operators  where  an  operator  is  a  schema.  In  both  the  first 
and  third  application  of  B^.  we  used  minimization  of  the  pseudometric 
as  the  choice  function  for  the  values  (x^y^  )»  the  point  from  which  we 
were  moving.  As  we  saw  in  the  second  application  of  B^>  this  is  not  an 
ideal  guide  (because  it  tells  the  machine  to  make  easy  gains)  but  it  is  a 
guide  to  the  choice  of  values  for  a  schema.  Used  in  conjunction  with  the 
preconditions  of  the  next  program,  it  becomes  a  very  useful  guide. 

Backtracking  is  employed  to  handle  cases  where  it  fails  to  guide  impeccably. 

One  final  point,  some  pseudometrics  arise  naturally  on  a  space  of 
models.  In  our  example,  the  distance  measure  c^  is  an  appropriate 
measure  due  to  considerations  resulting  from  Euclidean  space,  and  the 
existance  of  a  barrier.  Others,  such  as  the  measure  of  the  value  of  a 
window  discrepancy,  are  arbitrary.  Their  value  may  rest  on  the  judgement 
of  the  individuals  who  initially  describe  the  program. 

In  this  example  we  chose  the  programs  so  that  they  worked.  However, 
if  we  are  faced  with  a  problem  not  contrived  as  an  example,  we  could 
use  the  pseudometric  as  a  guide  to  problem  solution.  If  the  other 
programs  were  tried  and  then  checked  via  the  pseudometric  for 
improvement,  we  would  find  that  generally  the  best  program  would  also 
minimize  the  pseudometric.  This  occurs  because  the  pseudometric  values 
were  improvements  and  the  best  program  generally  results  in  improvement. 

VI.  Conclusion  and  research  topics 

In  this  paper,  several  extensions  of  problem  solvers  are  examined. 

Specifically,  we  looked  at  problem  solving  in  a  larger  language  than 

first  order  calculus,  one  that  incorporates  the  programs  which  form  the 

nucleus  of  problem  solving  systems.  Next,  a  method  of  enbedding  a  portion 

of  this  language  in  the  first  order  calculus  was  examined.  We  proposed  a  problem 

solving  system  that  extends  STRIPS  and  the  Resolution  Principle.  We  then  formalized 

the  notion  of  distance  in  models  and  showed  how  this  may  be  used  to  direct  the  search  of 

i  _ 
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the  machine  for  a  solution. 

Perhaps  the  most  important  research  topic  is  the  inclusion  of  program 
union  and  iteration  in  the  program  synthesis  algorithm.  These  are 
important  methods  of  combining  programs,  and  present  some  complications 
in  their  inclusion.  Lemma  2.11  shows  how  union  may  be  profitably  included. 
However,  the  correct  use  of  this  Lemma  is  difficult  to  incorporate  into  the 
algorithm.  Iteration  presents  a  tricker  problem,  as  the  weakest  pre¬ 
condition  is  an  infinite  disjunction.  Some  promising  research 
suggests  that  iterations  could  be  called  for  when  induction  is  called 
for.  However,  the  problem  of  when  to  use  it  still  remains. 

With  regard  to  pseudometrics,  an  important  line  of  inquiry  is  the 
optimal  utilization  of  the  measure  and  minimizing  the  cost  of  evaluation 
of  the  pseudometric.  Moreover  the  choice  of  the  criticality  values  still 
remains  open.  Effective  use  of  pseudometrics,  and  indeed,  efficient 
decision  procedures,  requires  additional  research  on  this  topic. 
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